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Abstract 

Answer-set programming (ASP) has emerged recently 
as a viable programming paradigm. We describe here 
an ASP system, DATALOG with constraints or DC, 
based on non-monotonic logic. Informally, DC theo- 
ries consist of propositional clauses (constraints) and 
of Horn rules. The semantics is a simple and nat- 
ural extension of the semantics of the propositional 
logic. However, thanks to the presence of Horn rules 
in the system, modeling of transitive closure becomes 
straightforward. We describe the syntax, use and im- 
plementation of DC and provide experimental results. 

General Info 

DC is an answer set programming (ASP) system 
(MT99) similar to propositional logic but extended to 
include Horn clauses. The semantics of DC is a natural 
extension of the semantics of propositional logic. The 
DC system is implemented in two modules, ground and 
dcs, which are written in the 'C programming language 
and compiled with gcc. There are approximately 2500 
lines of code for ground and approximately 1500 for 
dcs. DC has been implemented on both SUN SPARC 
and a PC running linux. 

A DC theory (or program) consists of constraints 
and Horn rules (DATALOG program). This fact mo- 
tivates our choice of terminology — DATALOG with 
constraints. We start a discussion of DC with the 
propositional case. Our language is determined by a 
set of atoms At. We will assume that At is of the form 
At = Ate U At Hi where Ate an d Atn arc disjoint. 

A DC theory (or program) is a triple T dc = 
(T c ,Th,Tpc)i where 

1. Tc is a set of propositional clauses ->ai V ... V ->a m V 
&i V . . . V b n such that all a, and bj are from At 



2. Th is a set of Horn rules a\ A . . . A a r< 
b G Atn and all are from At, 

3. Tpc is a set of clauses over At. 



c, 

b such that 



By At(T dc ), At c (T dc ) and At PC {T dc ) we denote the 
set of atoms from At, Ate and Atpc, respectively, that 
actually appear in T dc . 

With a DC theory T dc = (T C ,T H ,T PC ) we associate 
a family of subsets of Atc(T dc ). We say that a set 



M C Atc{T dc ) satisfies T dc (is an answer set of T dc ) if 

1. M satisfies all the clauses in Tc, and 

2. the closure of M under the Horn rules in Th, 
M c = LM(T H UM) satisfies all clauses in T PC (LM(P) 
denotes the least model of a Horn program P) . 

Intuitively, the collection of clauses in Tc can be 
thought of as a representation of the constraints of the 
problem, Horn rules in Th can be viewed as a mech- 
anism to compute closures of sets of atoms satisfying 
the constraints in Tc, and the clauses in Tpc can be 
regarded as constraints on closed sets (we refer to them 
as post-constraints) . A set of atoms M C Atc(T dc ) is a 
model if it (propositionally) satisfies the constraints in 
Tc and if its closure (propositionally) satisfies the post- 
constraints in Tpc- Thus, the semantics of DC retains 
much of the simplicity of the semantics of pro positio nal 
logic. DC was introduced by the authors in flETOC ). 



Applicability of the System 

DC can be used as a computational tool to solve search 
problems. We define a search problem n to be deter- 
mined by a set of finite instances, Du, such that for 
each instance / G Dn, there is a finite set Su(I) of all 
solutions to n for the instance /. For example, the prob- 
lem of finding a hamilton cycle in a graph is a search 
problem: graphs are instances and for each graph, its 
hamilton cycles (sets of their edges) are solutions. 

A DC theory T dc = (Tc,Th,Tpc) solves a search 
problem n if solutions to n can be computed (in 
polynomial time) from answer sets to T dc . Propo- 
sitiona l logic and stable logic programming (|MT9S| ; 
Nie9S ) are used as problem solving formalisms following 
the same general paradigm. 

We will illustrate the use of DC to solve search prob- 
lems by discussing the problem of finding a hamilton 
cycle in a directed graph. Consider a directed graph G 
with the vertex set V and the edge set E. Consider a 
set of atoms {hc(a,b): (a,b) G E}. An intuitive inter- 
pretation of an atom hc(a,b) is that the edge (a,b) is 
in a hamilton cycle. Include in Tc all clauses of the 
form ->hc(b, a) V ->hc(c, a), where a,b, c G V , b ^ c and 
(b,a), (c,a) € E. In addition, include in Tc all clauses 
of the form ->hc(a,b) V -ihc(a,c), where a,b,c G V, 



b 7^ c and (a, 6), (a, c) G D. Clearly, the set of preposi- 
tional variables of the form {hc(a, b): (a, b) G F}, where 
F C E, satisfies all clauses in Tc if and only if no two 
different edges in F end in the same vertex and no two 
different edges in F start in the same vertex. In other 
words, F spans a collection of paths and cycles in G. 

To guarantee that the edges in F define a hamil- 
ton cycle, we must enforce that all vertices of G are 
reached by means of the edges in F if we start in some 
(arbitrarily chosen) vertex of G. This can be accom- 
plished by means of a simple Horn program. Let us 
choose a vertex, say s, in G. Include in Th the Horn 
rules hc(s,t) — > vstd(t), for every edge (s,t) in G. In 
addition, include in Th Horn rules vstd(t),hc(t,u) — ► 
vstd(u), for every edge (t,u) of G not starting in s. 
Clearly, the least model of F U Th , where F is a sub- 
set of E, contains precisely these variables of the form 
vstd(t) for which t is reachable from s by a nonempty 
path spanned by the edges in F. Thus, F is the set of 
edges of a hamilton cycle of G if and only if the least 
model of F U Th, contains variable vstd(t) for every 
vertex t of G. Let us define Tpc = {vstd(t):t G V} 
and T ham (G) = (T c ,T ff ,T PC ). It follows that hamil- 
ton cycles of G can be reconstructed (in linear time) 
from answer sets to the DC theory T/ lam (G). In other 
words, to find a hamilton cycle in G, it is enough to 
find an answer set for Th am {G). 

This example illustrates the simplicity of the seman- 
tics of DC — it is only a slight adaptation of the seman- 
tics of propositional logic to the case when in addition 
to propositional clauses we also have Horn rules in the- 
ories. It also illustrates the power of DC to generate 
concise encodings. All known propositional encodings 
of the hamilton-cycle problem require that additional 
variables are introduced to "count" how far from the 
starting vertex an edge is located. Consequently, propo- 
sitional encodings are much larger and lead to inefficient 
solutions to the problem. 

Description of the System 

In this section we will discuss general features of DC. 
First, we will discuss the language for encoding prob- 
lems and give an example by showing the encoding of 
the hamiltonicity problem. Second, we will describe 
how we execute the DC system. Third, we will give 
some details concerning the implementation of des, our 
solver. Last we discuss the expressitivity of DC. 

In the previous section we described the logic of DC 
in the propositional case. Our definitions can be ex- 
tended to the predicate case (without function sym- 
bols). Each constraint is treated as an abbreviation of 
a set of its ground substitutions. This set is determined 
by the set of constants appearing in the theory and by 
additional conditions associated with the constraint ( 
we illustrate this idea later in this section). When con- 
structing predicate DC-based solutions to a problem n 
we separate the representation of an instance to n from 
the constraints that define H The representation of an 
instance of n is a collection of facts or the cxtcnsional 



database (EDB). The constraints and rules that define 
n will be referred to as the intensional database (IDB) 
and the language for writing the problem descriptions 
that constitute the IDB will be referred to as Ldc- The 
separation of IDB and EDB means only one predicate 
description of n is needed. 

The modules of DC, ground and des provide a com- 
plete system for describing and finding solutions to 
problems. An IDB in Ld c along with a specific EDB are 
the input to ground. A grounded propositional theory 
Tdc is output by ground and used as input to des. 

A problem description in Ld c defines predicates, de- 
clares variables, and provides a description of the prob- 
lem using rules. The predicates arc defined using types 
from the EDB. Similarly, the variables are declared 
using types from the EDB. The rules consist of con- 
straints, Horn rules and post-constraints. The con- 
straints and post-constraints use several constructs to 
allow a more natural modeling. These constructs could 
be directly translated to clauses. (We use them as 
shorthands to ensure the conciseness of encodings.) 

We present here a brief discussion of the constraints, 
Horn rules and post-constraint in Ld c and their mean- 
ings. Let PRED be the set of predicates occurring in 
the IDB. For each variable X declared in the IDB the 
range R(X) of X is determined by the EDB. 

Select(n, m, Y; Pl (X), . . . , Pi (X, Y))q{X, Y), where 
n, m are nonnegative integers such that n < m, q G 
PRED and p\, . . . ,Pi are EDB predicates or logical 
conditions (logical conditions can be comparisons of 
arithmetic expressions or string comparisons). The 
interpretation of this constraint is as follows: for ev- 
ery x G R(X) at least n atoms and at most m atoms 
in the set {q(x, y) : y G R(Y)} are true. 

Select(n, m, Y)q(X, Y), where n,m are nonnegative 
integers such that n < m, q G PRED. The inter- 
pretation of this constraint is as follows: for every 
x G R(X) at least n atoms and at most m atoms in 
the set {q(x,y) : y G R(Y)} are true. 

Select (n, m)qi(X), . . . , qj(X), where n, m are nonneg- 
ative integers such that n < m,qi, . . . ,qj G PRED. 
The interpretation of this constraint is as follows: for 
every x G R(X) at least n atoms and at most m 
atoms in the set {qi (x), . . . , qj(x)} are true. 

Certain choices of n, m in any of the Select constraints 
allow construction of even more specific constraints in 
Tdc- 

n = m exactly n atoms must be true. 
n = at most m atoms must be true, 
m = 999 at least n atoms must be true. 

NOT qi (X),..., qi (X), where q 1 ,...,q l G PRED. 
For every x G R(X) at least one atom in the set 
{<7i(x), . . . , (}i(x)} must be false. 

qi (X)\ . . . \ qi (X), where q u ...,qi G PRED. For 
every x G R(X) at least one atom in the set 



An example of a graph file used as EDB 

vtx(l). 

vtx(2). 

vtx(3). 

edge(l,3). 

edge(3,2). 



Figure 1: Format for EDB. 

{qi(x), . . . , qi(x)} must be true. 

Pi(X), . . . , Pi (X) -» gi (X)\ . . . \ gj (X), where 

Pi, ■ ■ ■ ,Pi, (ft, . . . , qj G PRED. For every x G R(X) 
if all atoms in the set {pi(x), . . . ,Pi(x)} are true then 
at least one atom in the set {q%(ot), . . . , qj(x)} must 
be true. (This constraint represents standard prepo- 
sitional logic clauses.) 

Pi{X), . . . , Pi (X) -> q% (X), . . -,qj(X), where 

Pi, ■ ■ ■ ,Pi, qi, ■ ■ ■ , qj G PRED. For every x G R(X) 
if all atoms in the set {pi(x), . . . ,pi(x)} are true then 
all atoms in the set {91(2), . . . , qj(x)} must be true. 

Horn pi(X), . . . ,p l {X) -> qi(X), . . .,qj(X), where 
Pi, . . . ,pi,qi, ...,qj G PRED. 

Methodology 

Here we show the encoding of a problem in DC. We will 
use the example of hamiltonicity of a graph which we 
discussed previously. Figure [l] shows the EDB format 
used by ground. This format is compatible to that used 
by smodels and others. However, we require that the 
EDB be in separate files from the IDB. The format for 
the EDB allows data to be entered as sets, ranges, or 
individual elements and constant values can be entered 
on the command line. 

The IDB provides a definition of the problem in Ld c - 
The IDB file has three parts. First a definition of the 
predicates, next the declaration of variables and last a 
set of constraints and Horn clauses. The types used in 
the IDB must be in the data file(s). For example, the 
only data types in the graph file (Fig. |l]) are vtx and 
edge. Thus the only data types which can be used in 
the IDB are vtx and edge. 

In the idbpred section of Fig. 0, we define two pred- 
icates, the vstd predicate and the he predicate. The vstd 
predicate has one parameter of type vtx and he has two 
parameters both of type vtx. 

The idbvar section of Fig. || declares two variables 
X, Y both of type vtx. 

The section containing the constraints, Horn clauses, 
and post-constraints is proceeded by the keyword id- 
brules (see Fig.||). The order in which the rules 
are entered is not important. The first constraint, 
Select(l, 1, Y; edge(X, Y))hc{X 1 Y)., ensures that each 
vertex has exactly one outgoing edge. The second con- 
straint, S elect (1, 1, X; edge(X, Y))hc(X, Y)., requires 
that each vertex has exactly one incoming edge. 



An example of a file used as IDB 
% comments begin with percent sign 
idbpred % section for defining predicates 
vstd (vtx). 
he (vtx, vtx). 

idbvar % section for declaring variables 
vtx X,Y. 

idbrules % rule section 
% constraints 

Select(l,l,Y;edge(X,Y)) hc(X,Y). 
Select(l,l,X;edge(X,Y)) hc(X,Y). 
% Horn rules 

Horn Forall(X,Y;X!=i,edge(X,Y)) 
vstd(X), hc(X,Y) -> vstd(Y). 
Horn Forall(X,Y;X==i,edge(X,Y)) 
hc(X,Y) -> vstd(Y). 
% post-constraints 

vstd(X). 



Figure 2: File showing IDB for the hamiltonicity of a 
graph. The types are from Fig. |l| 

The first Horn rule ranges over all X, Y such that 
edge(X, Y) G EDB and X ^ i where i is a con- 
stant used to initialize vstd(i). This rule requires both 
vstd(X) and hc(X, Y) to be true before we can assign 
the value true to vstd(Y). The second Horn rule only 
requires hc(X, Y) to be true before vstd{Y) is assigned 
value true; however, in this rule X is restricted to the 
value of i. 

The last line is a post-constraint that requires 
vstd(X) to be true for all X G R{X) ensuring that 
the cycle is closed. 

Running the system 

Here we will describe the steps for execution of the DC 
system. The first module of DC, ground, has as input 
a data file(s), a rule file and command line arguments. 
Output is the theory file which will be used as input 
to dcs. The theory is written to a file whose name is 
the catenation of the constants and file names given as 
command line arguments. The extension .tdc is then 
appended to the output file name. The command line 
arguments for ground: 

ground -r rf -d df [-c label=v] [-V] 

Required arguments 

-r rf is the file describing the problem. There must be 
exactly one rule file. 

-d df must be one or more files containing data that 
will be used to instantiate the theory. It is often 
convenient to use more than one file for data. 

Optional arguments 

-c This option allows use of constants in both data and 
rule files. When label is found while reading either 



file it is replaced by v. v can be any string that is 
valid for the data type. If label is to be used in a 
range then v must be an integer. For example, if the 
data file contains the entry queens[l..q]. then we 
can define the constant q with the option -c q=8. If 
more than one constant is needed then -c b=3 n—14 
defines both constants b,n using the -c option. 

-V The verbose options sends output to stdout during 
the execution of ground. This output may be useful 
for debugging of the data or rule files. 

For the example of hamiltonicity we could have a data 
file (see Fig. |l|) named l.gph and an IDB file (see Fig. 
^) named hep. The constant i is needed in the IDB file 
to initialize the first vertex in the graph. The command 
line argument would be: 

ground -r hep -d l.gph -c i=l 

The theory file produced would be named 
l_l.gph_hcp.tdc. 

The second module of the DC system, des, has as 
input the theory file produced by ground. A file named 
dcs.stat is created or appended with statistics concern- 
ing the results of executing des on the theory. The 
command line arguments are: 

des -f filename [-A] [-P] [-C] [-V] 

Required arguments 

-f filename is the name of the file containing a theory 
produced by ground. 

Optional arguments 

-A Prints the positive atoms for solved theories in read- 
able form. 

-P Prints the input theory and then exits. 

-C Counts the number of solutions. This information is 
recorded in the statistics file. 

-V Prints information during execution (branching, 
backtracking, etc). Useful for debugging. 

Discussion of des 

The DC solver uses a Davis-Putnam type approach, 
with backtracking, propagation and LookAHead to 
deal with constraints represented as clauses, select con- 
straints and Horn rules, and to search for answer sets. 
The LookAHead in DC is si milar to local processing 
performed in csat ( DABC96 ). However, we use differ- 
ent methods to determine how many literals to consider 
in the LookAHead phase. Other techniques, especially 
propagation and search heuristics, were designed specif- 
ically for the case of DC as they must take into account 
the presence of Horn rules in programs. 

Propagation consists of methods to reduce the theory. 
Literals which appear in the heads of Horn rules, lh S 
Ata require different interpretations. A literal is a lh 
if and only if it appears in the head of a Horn rule. 
We can not guess an assignment for lh rather it must 
be computed. We can only assign value true to lh if it 



appears in the head of a Horn rule for which all literals 
in the body of the Horn rule have been assigned the 
value true. If one or more literals in the body of a Horn 
rule have been assigned the value false then that rule 
is removed. If lh has not been assigned a value and 
all Horn rules in which lh appeared in the head have 
been removed then lh is assigned the value false. If we 
have a post-constraint that required a value be assigned 
to lh and the value cannot be computed then we must 
backtrack. 

Non Horn rules are constraints which must be satis- 
fied. These rules are identified by tags which indicate 
which method is needed to evaluate the constraint dur- 
ing propagation. 

The LookAHead procedure tests a number of literals 
not yet assigned a value. The LookAHead procedure is 
similar to the local processing procedure used for csat 
( DABC96 ). A literal is chosen, assigned the value true, 
then false using propagation to reduce and evaluate the 
resulting theories. If both evaluations of assignments 
result in conflicts then we return false and backtrack- 
ing will result. If only one evaluation results in conflict 
then we can assign the literal the opposite value and 
continue the LookAHead procedure. If neither evalua- 
tion results in conflict we cannot make any assignments, 
but we save information (number of forced literals and 
satisfied constraints) computed during the evaluations 
of the literal. 

The number of literals to be tested has been deter- 
mined empirically. It is obvious that if all unassigned 
literals were tested during each LookAHead it would 
greatly increase the time. However, if only a small num- 
ber of literals are to be tested during each LookAHead 
then they must be chosen to provide the best chance 
of reducing the theory. To choose the literals with the 
best chance of reducing the theory, we order the unas- 
signed literals based on a sum computed by totaling 
the weights of the unsatisfied constraints in which they 
appear. The constraint weight is based on both the 
length and type of constraint. The shorter the con- 
straint the larger the weight and when literals are re- 
moved the weight is recomputed. The literals are tested 
in descending order of the sum of constraints weights. 
Using this method we need to test only a very small 
number of literals during each LookAHead to obtain 
good results. 

At the completion of the LookAHead procedure, we 
use the information computed during the evaluation of 
literals to choose the next branching literal and its ini- 
tial truth assignment. 

Expressitivity 

The expressive power of DC is the same as that of logic 
programming with the stable-m odel sem antics. The fol- 
lowing theorem is presented in (ETOO) 

Theorem 1 The expressive power of DC is the same 
as that of stable logic programming. In particular, a 
decision problem U can be solved uniformly in DC if 
and only if H is in the class NP. 
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Table 1: Schur problem; times 
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Figure 3: Results of computing hamilton cycles;log 
scale 



Evaluating the System 

The DC system provides a language, L^ c , which facil- 
itates writing problem descriptions. Once an IDB is 
written in Ld c it can be used for any instance of the 
problem for which data, in the EDB format, is available 
or can be generated. It is possible to add constraints 
to IDB for a given problem when new requirements or 
constraints occur. The constructs used in Ld c allow for 
a natural description of constraints. Users need only 
know enough about a specific problem to be able to de- 
scribe the problem in L^c (there is a user's manual with 
examples). To help with programming in DC ground 
provides error messages and compiling information that 
are useful for debugging the IDB. 

Benchmarks 

The DC system has been executed using problems from 
NP, combinatorics, and planning. In particular, it has 
been used to compute hamilton cycles and colorings in 
graphs, to solve the A^-queens problem, to prove that 
the pigeonhole problem has no solution if the number 
of pigeons exceeds the number of holes, and to compute 
Schur numbers. 

The instances for computing hamilton cycles were ob- 
tained by randomly generating one thousand directed 
graphs with v = 30, 40..., 80 and density such that 
« 50% of the graphs contained hamilton cycles. 

The graphs for instances of coloring were one hundred 
randomly generated graphs for v — 50, 100, . . . , 300 
with density such that ~ 50% had solutions when en- 
code as 3-coloring. 



dcs 
csat 
smodels 



3-coloring: log scale 



150 200 250 

Number of Vertices 



Figure 4: Results on encodings of coloring problems;log 
scale 
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Figure 5: N-queens problem;log scale 



Comparison 

We have used the benchmarks to compare DC with sys- 
tems based on stable model semantics and satisfiabil- 
ity The performance of DC solver dcs was compared 
with smodels, a system for computing stable models of 
logic programs ( NS96 ), and cs at, a sys tems for test- 
ing propositional satisfiability (DABC96). In the case 
of smodels we used version 2.24 in c onjunc tion with 
the grounder lparse , version 0.99.41 flNic98p . The ex- 
tended rules (Sim99) allowed in the newer versions of 
smodels and lparse were used where applicable. The 
programs were all executed on a Sun SPARC Station 20. 
For each test we report the cpu user times for processing 
the corresponding propositional program or theory. We 
tested all three system using the benchmarks discussed 
in the previous section. 

Note that csat performs comparable to DC for pi- 
geonhole (see Fig. ]6|), N-queens (see Fig. ||) and col- 
oring (see Fig. 0). These are problems where DC en- 
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Table 2: Difference in size of theories for hamilton cycle 
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[Sim99] P. Simons. Extending the stable model se- 
mantics with more expressive rules. In M. Gelfond, 
N. Leone, and G. Pfeifer, editors, Proceedings of 5th 
International Conference, LPNMR '99, volume 1730 
of Lecture Notes in Artificial Intelligence, pages 305- 
316. Springer Verlag, 1999. 



codings do not use Horn rules (in the examples here 
only encodings for computing hamilton cycles use Horn 
rules). The closure properties of Horn rules allow for 
much smaller theories as shown in Table |[ The satis- 
fiability theories for computing hamilton cycles are so 
large that they were not practical to execute for over 
40 vertices (see Fig. ||). smodels performs much better 
than satisfiability solvers for computing the hamilton 
cycles although not as well as DC. The results for com- 
puting Schur numbers (see Table [j]) also show much 
better results for DC. 

Experimental results show that dcs often outper- 
forms systems based on satisfiability as well as systems 
based on non-monotonic logics, and that it constitutes 
a viable approach to solving problems in AI, constraint 
satisfaction and combinatorial optimization. We be- 
lieve that our focus on short encodings (see Fig. ^) is 
the key to the success of dcs. 
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